🎯 Reinforcement Learning - tomas.burkert · Scour

Reinforcement Learning from Human Feedback

arxiv.org·1d

💬Prompt Engineering

Hybrid neural–cognitive models reveal how memory shapes human reward learning

nature.com·2d

🧠Cognitive Science

Main Content || Math ∩ Programming

jeremykun.com·12h

📊Data Science

Quantization-Aware Distillation

ternarysearch.blogspot.com·1d·

Discuss: Hacker News

On Computation and Reinforcement Learning

arxiv.org·3d

💬Prompt Engineering

Adaptive Neuro-Symbolic Planning for smart agriculture microgrid orchestration in hybrid quantum-classical pipelines

dev.to·1d·

Discuss: DEV

💬Prompt Engineering

🥇Top AI Papers of the Week

nlp.elvissaravia.com·20h

💬Prompt Engineering

Personalized Adaptive Feedback System for Early Detection and Intervention of Fine‑Motor Skill Development in Preschool Children Using Wearable IMU Sensors and Reinforcement Learning

freederia.com·3d

25W06. Learning a language with the machine

z1nz0l1n.com·1d

AI Agents as Accountability Partners: Configurable Nudging for Your Goals

blog.turtleand.com·15h·

Discuss: DEV

💬Prompt Engineering

Deep reinforcement learning-based energy scheduling for green buildings with stationary and EV batteries of heterogeneous characteristics

sciencedirect.com·2d

💬Prompt Engineering

Continual learning and the post monolith AI era

baseten.co·2d·

Discuss: Hacker News

💬Prompt Engineering

From Prediction to Compilation: A Manifesto for Intrinsically Reliable AI

news.ycombinator.com·23h·

Discuss: Hacker News

💬Prompt Engineering

i10e-lab/HelloRL: A fully modular framework to make Reinforcement Learning quick and easy

github.com·2d·

Discuss: Hacker News

💬Prompt Engineering

Part 5: Reward Engineering: How to Shape Behaviors in Financial/Robotic Tasks

dev.to·3d·

Discuss: DEV

💬Prompt Engineering

Why reinforcement learning breaks at scale, and how a new method fixes it

techxplore.com·4d

💬Prompt Engineering

Hybrid Model‑Based / Model‑Free Reinforcement Learning for Energy‑Efficient Autonomous Warehouse Robot Navigation with Real‑Time Obstacle Prediction **Abstra...

freederia.com·3d

💬Prompt Engineering

Performance Tip of the Week #94: Decision making in a data-imperfect world

abseil.io·1d

💬Prompt Engineering

(8) AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents

arxiviq.substack.com

·32m·

Discuss: Substack

💬Prompt Engineering

Travel Recommendations of Tomorrow: Generative Artificial Intelligence and Travel Planning

onlinelibrary.wiley.com·1h

💬Prompt Engineering

Loading more...